Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 71102 |
| Missing cells | 26294 |
| Missing cells (%) | 2.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.7 MiB |
| Average record size in memory | 143.6 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 7 |
anio has constant value "71102" | Constant |
colonia has a high cardinality: 1340 distinct values | High cardinality |
consumo_prom_no_dom is highly correlated with consumo_prom | High correlation |
consumo_prom is highly correlated with consumo_prom_no_dom | High correlation |
alcaldia is highly correlated with nomgeo | High correlation |
nomgeo is highly correlated with alcaldia | High correlation |
consumo_total_mixto has 8327 (11.7%) missing values | Missing |
consumo_prom_dom has 4820 (6.8%) missing values | Missing |
consumo_total_dom has 4820 (6.8%) missing values | Missing |
consumo_prom_mixto has 8327 (11.7%) missing values | Missing |
consumo_total_mixto is highly skewed (γ1 = 21.76535468) | Skewed |
consumo_prom_dom is highly skewed (γ1 = 74.81862948) | Skewed |
consumo_prom_mixto is highly skewed (γ1 = 43.60044406) | Skewed |
consumo_prom is highly skewed (γ1 = 43.38268186) | Skewed |
consumo_prom_no_dom is highly skewed (γ1 = 40.71654298) | Skewed |
consumo_total_no_dom is highly skewed (γ1 = 22.5073679) | Skewed |
gid has unique values | Unique |
consumo_total_mixto has 17715 (24.9%) zeros | Zeros |
consumo_prom_dom has 9861 (13.9%) zeros | Zeros |
consumo_total_dom has 9861 (13.9%) zeros | Zeros |
consumo_prom_mixto has 17715 (24.9%) zeros | Zeros |
consumo_total has 2451 (3.4%) zeros | Zeros |
consumo_prom has 2451 (3.4%) zeros | Zeros |
consumo_prom_no_dom has 8109 (11.4%) zeros | Zeros |
consumo_total_no_dom has 8109 (11.4%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-30 21:17:30.399226 |
|---|---|
| Analysis finished | 2020-09-30 21:17:46.826647 |
| Duration | 16.43 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 24339 |
|---|---|
| Distinct (%) | 38.8% |
| Missing | 8327 |
| Missing (%) | 11.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 174.3599291 |
|---|---|
| Minimum | 0 |
| Maximum | 23404.44 |
| Zeros | 17715 |
| Zeros (%) | 24.9% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 79.94 |
| Q3 | 233.32 |
| 95-th percentile | 660.779 |
| Maximum | 23404.44 |
| Range | 23404.44 |
| Interquartile range (IQR) | 233.32 |
Descriptive statistics
| Standard deviation | 312.663596 |
|---|---|
| Coefficient of variation (CV) | 1.793207864 |
| Kurtosis | 1419.360189 |
| Mean | 174.3599291 |
| Median Absolute Deviation (MAD) | 79.94 |
| Skewness | 21.76535468 |
| Sum | 10945444.55 |
| Variance | 97758.52424 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 17715 | 24.9% | |
| 36 | 74 | 0.1% | |
| 17.7 | 61 | 0.1% | |
| 36.6 | 59 | 0.1% | |
| 18.3 | 54 | 0.1% | |
| 29.28 | 52 | 0.1% | |
| 57.96 | 50 | 0.1% | |
| 23.8 | 48 | 0.1% | |
| 43.32 | 47 | 0.1% | |
| 46.98 | 46 | 0.1% | |
| Other values (24329) | 44569 | 62.7% | |
| (Missing) | 8327 | 11.7% |
| Value | Count | Frequency (%) | |
| 0 | 17715 | 24.9% | |
| 0.12 | 1 | < 0.1% | |
| 0.24 | 4 | < 0.1% | |
| 0.27 | 3 | < 0.1% | |
| 0.35 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 23404.44 | 1 | < 0.1% | |
| 23058.9 | 1 | < 0.1% | |
| 23031.06 | 1 | < 0.1% | |
| 5979.71 | 1 | < 0.1% | |
| 5974.32 | 1 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 555.5 KiB |
| 2019 |
|---|
| Value | Count | Frequency (%) | |
| 2019 | 71102 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| iztapalapa | |
|---|---|
| gustavo a. madero | |
| cuauhtemoc | |
| benito juarez | |
| venustiano carranza | |
| Other values (11) |
| Value | Count | Frequency (%) | |
| iztapalapa | 10515 | 14.8% | |
| gustavo a. madero | 10058 | 14.1% | |
| cuauhtemoc | 7313 | 10.3% | |
| benito juarez | 6049 | 8.5% | |
| venustiano carranza | 5179 | 7.3% | |
| miguel hidalgo | 5110 | 7.2% | |
| coyoacan | 4947 | 7.0% | |
| azcapotzalco | 4216 | 5.9% | |
| alvaro obregon | 4140 | 5.8% | |
| iztacalco | 3469 | 4.9% | |
| Other values (6) | 10106 | 14.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 22 |
|---|---|
| Median length | 12 |
| Mean length | 12.43351804 |
| Min length | 7 |
| Distinct | 52060 |
|---|---|
| Distinct (%) | 78.5% |
| Missing | 4820 |
| Missing (%) | 6.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.13238577 |
|---|---|
| Minimum | 0 |
| Maximum | 7796.41 |
| Zeros | 9861 |
| Zeros (%) | 13.9% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 18.69054691 |
| median | 26.41424809 |
| Q3 | 36.24656251 |
| 95-th percentile | 59.39294171 |
| Maximum | 7796.41 |
| Range | 7796.41 |
| Interquartile range (IQR) | 17.5560156 |
Descriptive statistics
| Standard deviation | 64.56592495 |
|---|---|
| Coefficient of variation (CV) | 2.216293765 |
| Kurtosis | 7663.654738 |
| Mean | 29.13238577 |
| Median Absolute Deviation (MAD) | 8.738705357 |
| Skewness | 74.81862948 |
| Sum | 1930952.794 |
| Variance | 4168.758665 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 9861 | 13.9% | |
| 1.22 | 33 | < 0.1% | |
| 14.64 | 23 | < 0.1% | |
| 10.98 | 22 | < 0.1% | |
| 15.25 | 22 | < 0.1% | |
| 9.76 | 21 | < 0.1% | |
| 9.15 | 21 | < 0.1% | |
| 20.48 | 20 | < 0.1% | |
| 7.93 | 20 | < 0.1% | |
| 11.59 | 20 | < 0.1% | |
| Other values (52050) | 56219 | 79.1% | |
| (Missing) | 4820 | 6.8% |
| Value | Count | Frequency (%) | |
| 0 | 9861 | 13.9% | |
| 0.009999999776 | 1 | < 0.1% | |
| 0.02 | 1 | < 0.1% | |
| 0.12 | 2 | < 0.1% | |
| 0.1299999952 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7796.41 | 1 | < 0.1% | |
| 7581.69 | 1 | < 0.1% | |
| 6073.459961 | 1 | < 0.1% | |
| 3726.5 | 1 | < 0.1% | |
| 3622.2 | 1 | < 0.1% |
| Distinct | 47051 |
|---|---|
| Distinct (%) | 71.0% |
| Missing | 4820 |
| Missing (%) | 6.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1186.263611 |
|---|---|
| Minimum | 0 |
| Maximum | 95060.69 |
| Zeros | 9861 |
| Zeros (%) | 13.9% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 161.635 |
| median | 604.185 |
| Q3 | 1261.445 |
| 95-th percentile | 4027.52 |
| Maximum | 95060.69 |
| Range | 95060.69 |
| Interquartile range (IQR) | 1099.81 |
Descriptive statistics
| Standard deviation | 2771.038307 |
|---|---|
| Coefficient of variation (CV) | 2.33593805 |
| Kurtosis | 248.0413047 |
| Mean | 1186.263611 |
| Median Absolute Deviation (MAD) | 517.3 |
| Skewness | 12.52320362 |
| Sum | 78627924.68 |
| Variance | 7678653.301 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 9861 | 13.9% | |
| 1.22 | 37 | 0.1% | |
| 10.98 | 21 | < 0.1% | |
| 3.66 | 20 | < 0.1% | |
| 14.64 | 20 | < 0.1% | |
| 25.62 | 20 | < 0.1% | |
| 18.3 | 19 | < 0.1% | |
| 15.25 | 19 | < 0.1% | |
| 7.93 | 19 | < 0.1% | |
| 17.69 | 18 | < 0.1% | |
| Other values (47041) | 56228 | 79.1% | |
| (Missing) | 4820 | 6.8% |
| Value | Count | Frequency (%) | |
| 0 | 9861 | 13.9% | |
| 0.12 | 1 | < 0.1% | |
| 0.24 | 1 | < 0.1% | |
| 0.5 | 2 | < 0.1% | |
| 0.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 95060.69 | 1 | < 0.1% | |
| 94021.7 | 1 | < 0.1% | |
| 90078.44 | 1 | < 0.1% | |
| 83309.94 | 1 | < 0.1% | |
| 82689.38 | 1 | < 0.1% |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| iztapalapa | |
|---|---|
| gustavo a. madero | |
| cuauhtemoc | |
| benito juarez | |
| venustiano carranza | |
| Other values (11) |
| Value | Count | Frequency (%) | |
| iztapalapa | 10515 | 14.8% | |
| gustavo a. madero | 10058 | 14.1% | |
| cuauhtemoc | 7313 | 10.3% | |
| benito juarez | 6049 | 8.5% | |
| venustiano carranza | 5179 | 7.3% | |
| miguel hidalgo | 5110 | 7.2% | |
| coyoacan | 4947 | 7.0% | |
| azcapotzalco | 4216 | 5.9% | |
| alvaro obregon | 4140 | 5.8% | |
| iztacalco | 3469 | 4.9% | |
| Other values (6) | 10106 | 14.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 19 |
|---|---|
| Median length | 12 |
| Mean length | 12.25522489 |
| Min length | 7 |
| Distinct | 1340 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 189.3 KiB |
| centro | 1139 |
|---|---|
| agricola oriental | 837 |
| roma norte | 602 |
| moctezuma 2a seccion | 558 |
| jardin balbuena | 498 |
| Other values (1335) |
| Value | Count | Frequency (%) | |
| centro | 1139 | 1.6% | |
| agricola oriental | 837 | 1.2% | |
| roma norte | 602 | 0.8% | |
| moctezuma 2a seccion | 558 | 0.8% | |
| jardin balbuena | 498 | 0.7% | |
| doctores | 490 | 0.7% | |
| san felipe de jesus | 419 | 0.6% | |
| roma sur | 418 | 0.6% | |
| obrera | 418 | 0.6% | |
| agricola pantitlan | 417 | 0.6% | |
| Other values (1330) | 65306 | 91.8% |
Frequencies of value counts
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 43 |
|---|---|
| Median length | 16 |
| Mean length | 16.86555934 |
| Min length | 4 |
| Distinct | 31911 |
|---|---|
| Distinct (%) | 50.8% |
| Missing | 8327 |
| Missing (%) | 11.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.63623377 |
|---|---|
| Minimum | 0 |
| Maximum | 11702.22 |
| Zeros | 17715 |
| Zeros (%) | 24.9% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 33.45166667 |
| Q3 | 61.21654793 |
| 95-th percentile | 162.2529989 |
| Maximum | 11702.22 |
| Range | 11702.22 |
| Interquartile range (IQR) | 61.21654793 |
Descriptive statistics
| Standard deviation | 130.4086734 |
|---|---|
| Coefficient of variation (CV) | 2.575402309 |
| Kurtosis | 3263.991441 |
| Mean | 50.63623377 |
| Median Absolute Deviation (MAD) | 33.33333333 |
| Skewness | 43.60044406 |
| Sum | 3178689.575 |
| Variance | 17006.42209 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 17715 | 24.9% | |
| 36 | 58 | 0.1% | |
| 29.28 | 57 | 0.1% | |
| 36.6 | 53 | 0.1% | |
| 23.8 | 49 | 0.1% | |
| 25.62 | 48 | 0.1% | |
| 26.84 | 47 | 0.1% | |
| 18.92 | 45 | 0.1% | |
| 1.84 | 45 | 0.1% | |
| 11.6 | 45 | 0.1% | |
| Other values (31901) | 44613 | 62.7% | |
| (Missing) | 8327 | 11.7% |
| Value | Count | Frequency (%) | |
| 0 | 17715 | 24.9% | |
| 0.1199999973 | 2 | < 0.1% | |
| 0.1899999976 | 1 | < 0.1% | |
| 0.19 | 2 | < 0.1% | |
| 0.2399999946 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 11702.22 | 1 | < 0.1% | |
| 11529.44971 | 1 | < 0.1% | |
| 11515.53 | 1 | < 0.1% | |
| 5808 | 3 | < 0.1% | |
| 4919.04 | 1 | < 0.1% |
| Distinct | 56015 |
|---|---|
| Distinct (%) | 78.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1695.847222 |
|---|---|
| Minimum | 0 |
| Maximum | 119726.94 |
| Zeros | 2451 |
| Zeros (%) | 3.4% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6.49 |
| Q1 | 340.9525 |
| median | 896.175 |
| Q3 | 1808.9025 |
| 95-th percentile | 5564.1965 |
| Maximum | 119726.94 |
| Range | 119726.94 |
| Interquartile range (IQR) | 1467.95 |
Descriptive statistics
| Standard deviation | 3555.697457 |
|---|---|
| Coefficient of variation (CV) | 2.096708601 |
| Kurtosis | 195.8775277 |
| Mean | 1695.847222 |
| Median Absolute Deviation (MAD) | 664.505 |
| Skewness | 10.99825971 |
| Sum | 120578129.2 |
| Variance | 12642984.41 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 2451 | 3.4% | |
| 3.05 | 70 | 0.1% | |
| 1.22 | 68 | 0.1% | |
| 3.66 | 42 | 0.1% | |
| 6.71 | 41 | 0.1% | |
| 1.83 | 40 | 0.1% | |
| 7.93 | 39 | 0.1% | |
| 6.1 | 36 | 0.1% | |
| 9.76 | 36 | 0.1% | |
| 4.88 | 36 | 0.1% | |
| Other values (56005) | 68243 | 96.0% |
| Value | Count | Frequency (%) | |
| 0 | 2451 | 3.4% | |
| 0.01 | 3 | < 0.1% | |
| 0.05 | 3 | < 0.1% | |
| 0.12 | 5 | < 0.1% | |
| 0.24 | 18 | < 0.1% |
| Value | Count | Frequency (%) | |
| 119726.94 | 1 | < 0.1% | |
| 117150.91 | 1 | < 0.1% | |
| 101035 | 1 | < 0.1% | |
| 95117.77 | 1 | < 0.1% | |
| 94078.2 | 1 | < 0.1% |
| Distinct | 62214 |
|---|---|
| Distinct (%) | 87.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 111.2173991 |
|---|---|
| Minimum | 0 |
| Maximum | 89691.77344 |
| Zeros | 2451 |
| Zeros (%) | 3.4% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.867208333 |
| Q1 | 23.01013907 |
| median | 31.69381809 |
| Q3 | 45.48491686 |
| 95-th percentile | 188.219501 |
| Maximum | 89691.77344 |
| Range | 89691.77344 |
| Interquartile range (IQR) | 22.47477779 |
Descriptive statistics
| Standard deviation | 1069.949262 |
|---|---|
| Coefficient of variation (CV) | 9.620340614 |
| Kurtosis | 2599.541185 |
| Mean | 111.2173991 |
| Median Absolute Deviation (MAD) | 10.31349875 |
| Skewness | 43.38268186 |
| Sum | 7907779.51 |
| Variance | 1144791.422 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 2451 | 3.4% | |
| 1.22 | 62 | 0.1% | |
| 3.05 | 55 | 0.1% | |
| 4.27 | 43 | 0.1% | |
| 6.71 | 39 | 0.1% | |
| 4.88 | 38 | 0.1% | |
| 3.66 | 38 | 0.1% | |
| 1.83 | 38 | 0.1% | |
| 9.76 | 37 | 0.1% | |
| 7.93 | 36 | 0.1% | |
| Other values (62204) | 68265 | 96.0% |
| Value | Count | Frequency (%) | |
| 0 | 2451 | 3.4% | |
| 0.009999999776 | 1 | < 0.1% | |
| 0.01 | 2 | < 0.1% | |
| 0.05 | 2 | < 0.1% | |
| 0.05000000075 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 89691.77344 | 1 | < 0.1% | |
| 87179.61 | 1 | < 0.1% | |
| 80555.01 | 1 | < 0.1% | |
| 56873.96 | 1 | < 0.1% | |
| 54935.99 | 1 | < 0.1% |
| Distinct | 37440 |
|---|---|
| Distinct (%) | 52.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.7601718 |
|---|---|
| Minimum | 0 |
| Maximum | 89691.77344 |
| Zeros | 8109 |
| Zeros (%) | 11.4% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6.2754167 |
| median | 19.28000034 |
| Q3 | 54.186875 |
| 95-th percentile | 333.6616663 |
| Maximum | 89691.77344 |
| Range | 89691.77344 |
| Interquartile range (IQR) | 47.9114583 |
Descriptive statistics
| Standard deviation | 1095.817805 |
|---|---|
| Coefficient of variation (CV) | 8.64481161 |
| Kurtosis | 2364.161672 |
| Mean | 126.7601718 |
| Median Absolute Deviation (MAD) | 16.85000034 |
| Skewness | 40.71654298 |
| Sum | 9012901.734 |
| Variance | 1200816.661 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 8109 | 11.4% | |
| 1.22 | 330 | 0.5% | |
| 1.83 | 290 | 0.4% | |
| 3.05 | 260 | 0.4% | |
| 4.27 | 216 | 0.3% | |
| 7.93 | 203 | 0.3% | |
| 3.66 | 202 | 0.3% | |
| 4.88 | 201 | 0.3% | |
| 6.1 | 193 | 0.3% | |
| 6.71 | 190 | 0.3% | |
| Other values (37430) | 60908 | 85.7% |
| Value | Count | Frequency (%) | |
| 0 | 8109 | 11.4% | |
| 0.009999999776 | 1 | < 0.1% | |
| 0.01 | 2 | < 0.1% | |
| 0.012 | 1 | < 0.1% | |
| 0.01499999966 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 89691.77344 | 1 | < 0.1% | |
| 87179.61 | 1 | < 0.1% | |
| 80555.01 | 1 | < 0.1% | |
| 56873.96 | 1 | < 0.1% | |
| 54935.99 | 1 | < 0.1% |
bimestre
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 69.5 KiB |
| 2 | |
|---|---|
| 3 | |
| 1 |
| Value | Count | Frequency (%) | |
| 2 | 23942 | 33.7% | |
| 3 | 23822 | 33.5% | |
| 1 | 23338 | 32.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct | 27336 |
|---|---|
| Distinct (%) | 38.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 436.0603092 |
|---|---|
| Minimum | 0 |
| Maximum | 119726.94 |
| Zeros | 8109 |
| Zeros (%) | 11.4% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 10.98 |
| median | 54.055 |
| Q3 | 230.43 |
| 95-th percentile | 1695.6175 |
| Maximum | 119726.94 |
| Range | 119726.94 |
| Interquartile range (IQR) | 219.45 |
Descriptive statistics
| Standard deviation | 2126.152162 |
|---|---|
| Coefficient of variation (CV) | 4.875821343 |
| Kurtosis | 798.0749258 |
| Mean | 436.0603092 |
| Median Absolute Deviation (MAD) | 52.875 |
| Skewness | 22.5073679 |
| Sum | 31004760.1 |
| Variance | 4520523.018 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 8109 | 11.4% | |
| 1.22 | 402 | 0.6% | |
| 1.83 | 316 | 0.4% | |
| 3.05 | 302 | 0.4% | |
| 7.93 | 219 | 0.3% | |
| 1.18 | 217 | 0.3% | |
| 4.88 | 212 | 0.3% | |
| 4.27 | 212 | 0.3% | |
| 3.66 | 211 | 0.3% | |
| 6.1 | 195 | 0.3% | |
| Other values (27326) | 60707 | 85.4% |
| Value | Count | Frequency (%) | |
| 0 | 8109 | 11.4% | |
| 0.01 | 3 | < 0.1% | |
| 0.03 | 1 | < 0.1% | |
| 0.05 | 3 | < 0.1% | |
| 0.08 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 119726.94 | 1 | < 0.1% | |
| 117150.91 | 1 | < 0.1% | |
| 101035 | 1 | < 0.1% | |
| 89691.8 | 1 | < 0.1% | |
| 88204.37 | 1 | < 0.1% |
| Distinct | 71102 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 71102 | 1 |
|---|---|
| 23703 | 1 |
| 23697 | 1 |
| 23698 | 1 |
| 23699 | 1 |
| Other values (71097) |
| Value | Count | Frequency (%) | |
| 71102 | 1 | < 0.1% | |
| 23703 | 1 | < 0.1% | |
| 23697 | 1 | < 0.1% | |
| 23698 | 1 | < 0.1% | |
| 23699 | 1 | < 0.1% | |
| 23700 | 1 | < 0.1% | |
| 23701 | 1 | < 0.1% | |
| 23702 | 1 | < 0.1% | |
| 23704 | 1 | < 0.1% | |
| 23712 | 1 | < 0.1% | |
| Other values (71092) | 71092 | > 99.9% |
Frequencies of value counts
Unique
| Unique | 71102 ? |
|---|---|
| Unique (%) | 100.0% |
Histogram of lengths of the category
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.843801862 |
| Min length | 1 |
indice_des
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 69.6 KiB |
| bajo | |
|---|---|
| popular | |
| alto | |
| medio |
| Value | Count | Frequency (%) | |
| bajo | 29248 | 41.1% | |
| popular | 16539 | 23.3% | |
| alto | 15516 | 21.8% | |
| medio | 9799 | 13.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.835644567 |
| Min length | 4 |
latitud
Real number (ℝ≥0)
| Distinct | 22930 |
|---|---|
| Distinct (%) | 32.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.39227276 |
|---|---|
| Minimum | 19.13586653 |
| Maximum | 19.57910261 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | 19.13586653 |
|---|---|
| 5-th percentile | 19.27217463 |
| Q1 | 19.34407317 |
| median | 19.39291026 |
| Q3 | 19.44681849 |
| 95-th percentile | 19.49744601 |
| Maximum | 19.57910261 |
| Range | 0.4432360842 |
| Interquartile range (IQR) | 0.1027453211 |
Descriptive statistics
| Standard deviation | 0.07054946408 |
|---|---|
| Coefficient of variation (CV) | 0.003638019377 |
| Kurtosis | -0.3299967947 |
| Mean | 19.39227276 |
| Median Absolute Deviation (MAD) | 0.05121505235 |
| Skewness | -0.2209675789 |
| Sum | 1378829.378 |
| Variance | 0.004977226881 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 19.49545978 | 21 | < 0.1% | |
| 19.50314865 | 21 | < 0.1% | |
| 19.44888183 | 21 | < 0.1% | |
| 19.51433121 | 21 | < 0.1% | |
| 19.30094699 | 21 | < 0.1% | |
| 19.51126852 | 21 | < 0.1% | |
| 19.51081678 | 21 | < 0.1% | |
| 19.41716896 | 13 | < 0.1% | |
| 19.49661646 | 13 | < 0.1% | |
| 19.51160136 | 12 | < 0.1% | |
| Other values (22920) | 70917 | 99.7% |
| Value | Count | Frequency (%) | |
| 19.13586653 | 3 | < 0.1% | |
| 19.13628997 | 3 | < 0.1% | |
| 19.16951445 | 2 | < 0.1% | |
| 19.1728973 | 3 | < 0.1% | |
| 19.173993 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 19.57910261 | 3 | < 0.1% | |
| 19.57503233 | 3 | < 0.1% | |
| 19.57456726 | 3 | < 0.1% | |
| 19.5718567 | 3 | < 0.1% | |
| 19.57141877 | 3 | < 0.1% |
longitud
Real number (ℝ)
| Distinct | 22930 |
|---|---|
| Distinct (%) | 32.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -99.13289588 |
|---|---|
| Minimum | -99.33770342 |
| Maximum | -98.95046917 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 555.5 KiB |
Quantile statistics
| Minimum | -99.33770342 |
|---|---|
| 5-th percentile | -99.2236241 |
| Q1 | -99.17248433 |
| median | -99.13519579 |
| Q3 | -99.09663337 |
| 95-th percentile | -99.02915715 |
| Maximum | -98.95046917 |
| Range | 0.3872342535 |
| Interquartile range (IQR) | 0.07585096428 |
Descriptive statistics
| Standard deviation | 0.05789023819 |
|---|---|
| Coefficient of variation (CV) | -0.0005839659749 |
| Kurtosis | 0.03317853179 |
| Mean | -99.13289588 |
| Median Absolute Deviation (MAD) | 0.0378663909 |
| Skewness | 0.1247230301 |
| Sum | -7048547.163 |
| Variance | 0.003351279677 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| -99.13756286 | 21 | < 0.1% | |
| -99.20421465 | 21 | < 0.1% | |
| -99.15821728 | 21 | < 0.1% | |
| -99.14369261 | 21 | < 0.1% | |
| -99.08903493 | 21 | < 0.1% | |
| -99.18589472 | 21 | < 0.1% | |
| -99.20751574 | 21 | < 0.1% | |
| -99.17071426 | 13 | < 0.1% | |
| -99.19306813 | 13 | < 0.1% | |
| -99.14127961 | 12 | < 0.1% | |
| Other values (22920) | 70917 | 99.7% |
| Value | Count | Frequency (%) | |
| -99.33770342 | 3 | < 0.1% | |
| -99.32799413 | 3 | < 0.1% | |
| -99.32592098 | 3 | < 0.1% | |
| -99.32544326 | 3 | < 0.1% | |
| -99.32502513 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| -98.95046917 | 3 | < 0.1% | |
| -98.95128667 | 3 | < 0.1% | |
| -98.95334634 | 3 | < 0.1% | |
| -98.95408029 | 3 | < 0.1% | |
| -98.95769198 | 3 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| consumo_total_mixto | anio | nomgeo | consumo_prom_dom | consumo_total_dom | alcaldia | colonia | consumo_prom_mixto | consumo_total | consumo_prom | consumo_prom_no_dom | bimestre | consumo_total_no_dom | gid | indice_des | latitud | longitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 159.72 | 2019 | gustavo a. madero | 42.566364 | 468.23 | gustavo a. madero | 7 de noviembre | 53.24000 | 631.00 | 42.066667 | 3.050000 | 3 | 3.05 | 57250 | alto | 19.455260 | -99.112662 |
| 1 | 0.00 | 2019 | gustavo a. madero | 35.936667 | 107.81 | gustavo a. madero | 7 de noviembre | 0.00000 | 115.13 | 28.782500 | 7.320000 | 3 | 7.32 | 57253 | medio | 19.455260 | -99.112662 |
| 2 | 0.00 | 2019 | gustavo a. madero | 24.586000 | 122.93 | gustavo a. madero | 7 de noviembre | 0.00000 | 197.96 | 32.993333 | 75.030000 | 3 | 75.03 | 57255 | popular | 19.455720 | -99.113582 |
| 3 | 0.00 | 2019 | gustavo a. madero | 0.000000 | 0.00 | gustavo a. madero | nueva tenochtitlan | 0.00000 | 253.53 | 84.510000 | 84.510000 | 3 | 253.53 | 57267 | bajo | 19.459647 | -99.104469 |
| 4 | 56.72 | 2019 | azcapotzalco | 67.436250 | 539.49 | azcapotzalco | prohogar | 56.72000 | 839.35 | 76.304545 | 121.570000 | 3 | 243.14 | 57330 | bajo | 19.474161 | -99.146750 |
| 5 | 439.77 | 2019 | azcapotzalco | 35.675769 | 927.57 | azcapotzalco | trabajadores del hierro | 54.97125 | 1399.67 | 37.828919 | 10.776667 | 3 | 32.33 | 57273 | bajo | 19.478613 | -99.150571 |
| 6 | 991.80 | 2019 | azcapotzalco | 22.381884 | 4633.05 | azcapotzalco | barrio coltongo | 123.97500 | 7693.64 | 33.305801 | 129.299375 | 3 | 2068.79 | 57275 | bajo | 19.480211 | -99.152316 |
| 7 | 0.00 | 2019 | azcapotzalco | 0.000000 | 0.00 | azcapotzalco | barrio coltongo | 0.00000 | 305.00 | 152.500000 | 152.500000 | 3 | 305.00 | 57276 | popular | 19.479096 | -99.148920 |
| 8 | 184.86 | 2019 | azcapotzalco | 33.661176 | 1716.72 | azcapotzalco | trabajadores del hierro | 46.21500 | 1903.66 | 33.993929 | 2.080000 | 3 | 2.08 | 57277 | bajo | 19.478585 | -99.148847 |
| 9 | 10.98 | 2019 | azcapotzalco | 51.912500 | 207.65 | azcapotzalco | trabajadores del hierro | 10.98000 | 237.54 | 29.692500 | 6.303333 | 3 | 18.91 | 57281 | bajo | 19.477273 | -99.147921 |
Last rows
| consumo_total_mixto | anio | nomgeo | consumo_prom_dom | consumo_total_dom | alcaldia | colonia | consumo_prom_mixto | consumo_total | consumo_prom | consumo_prom_no_dom | bimestre | consumo_total_no_dom | gid | indice_des | latitud | longitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 71092 | NaN | 2019 | cuauhtemoc | 18.328530 | 623.17 | cuauhtemoc | doctores | NaN | 1148.67 | 22.973400 | 32.843751 | 1 | 525.50 | 219 | medio | 19.424418 | -99.144312 |
| 71093 | 226.04 | 2019 | cuauhtemoc | 13.793529 | 234.49 | cuauhtemoc | centro | 113.019997 | 718.31 | 19.953056 | 15.163530 | 1 | 257.78 | 221 | medio | 19.431082 | -99.143493 |
| 71094 | 53.74 | 2019 | cuauhtemoc | 24.466017 | 2886.99 | cuauhtemoc | guerrero | 26.870001 | 3052.04 | 21.342937 | 4.839565 | 1 | 111.31 | 231 | bajo | 19.449404 | -99.138990 |
| 71095 | 749.40 | 2019 | cuauhtemoc | 21.016237 | 7818.03 | cuauhtemoc | guerrero | 107.057143 | 8749.27 | 22.150025 | 11.363750 | 1 | 181.82 | 230 | bajo | 19.449682 | -99.140397 |
| 71096 | 68.68 | 2019 | cuauhtemoc | 22.584910 | 6301.20 | cuauhtemoc | guerrero | 68.680000 | 6690.78 | 23.394301 | 53.483335 | 1 | 320.90 | 234 | bajo | 19.449003 | -99.143680 |
| 71097 | 359.88 | 2019 | cuauhtemoc | 282.152488 | 1128.61 | cuauhtemoc | guerrero | 179.940002 | 1509.15 | 167.683328 | 6.886667 | 1 | 20.66 | 236 | bajo | 19.450079 | -99.144435 |
| 71098 | 401.32 | 2019 | cuauhtemoc | 25.021442 | 2777.38 | cuauhtemoc | guerrero | 100.330000 | 3318.63 | 27.426694 | 23.321667 | 1 | 139.93 | 240 | bajo | 19.448210 | -99.144851 |
| 71099 | 142.25 | 2019 | cuauhtemoc | 27.043654 | 1406.27 | cuauhtemoc | guerrero | 28.450001 | 1586.61 | 25.590484 | 7.618000 | 1 | 38.09 | 241 | popular | 19.447826 | -99.143819 |
| 71100 | 31.42 | 2019 | cuauhtemoc | 18.601529 | 3162.26 | cuauhtemoc | guerrero | 15.710001 | 3250.39 | 18.260618 | 9.451667 | 1 | 56.71 | 243 | bajo | 19.448187 | -99.142392 |
| 71101 | 976.12 | 2019 | cuauhtemoc | 23.949699 | 8765.60 | cuauhtemoc | guerrero | 162.686669 | 9858.46 | 26.011741 | 16.677143 | 1 | 116.74 | 246 | bajo | 19.447683 | -99.141193 |